Cross-Language Ontology Alignment Utilizing Machine Translation Models
نویسندگان
چکیده
In the context of ontology alignment, linguistic analysis is a prominent solution, used by various proposed methodologies. When mapping ontologies that use the same language, the existent approaches have been shown to produce significant results, being able to handle complex descriptions of the enclosed concepts and properties. In order to expand the applied linguistic methods in a cross-language context, i.e. to align ontologies that use different languages, it is essential to automate the process of finding lexical correspondences, beyond simple term translation, between the entity descriptions provided by the involved ontologies. The present paper proposes a machine learning approach to obtain the optimal from a set of translation provided by different automated machine translation services, in order to use it as the basis for aligning ontology pairs that provide complex descriptions expressed in different languages.
منابع مشابه
Linking Knowledge Graphs across Languages with Semantic Similarity and Machine Translation
Knowledge graphs and ontologies underpin many natural language processing applications, and to apply these to new languages, these knowledge graphs must be translated. Up until now, this has been achieved either by direct label translation or by cross-lingual alignment, which matches the concepts in the graph to another graph in the target languages. We show that these two approaches can, in fa...
متن کاملCross-Lingual Ontology Mapping - An Investigation of the Impact of Machine Translation
Ontologies are at the heart of knowledge management and make use of information that is not only written in English but also in many other natural languages. In order to enable knowledge discovery, sharing and reuse of these multilingual ontologies, it is necessary to support ontology mapping despite natural language barriers. This paper examines the soundness of a generic approach that involve...
متن کاملTwisted Pair Grammar: Support for Rapid Development of Machine Translation for Low Density Languages
An English-to-Turkish Interlingual MT System p. 83 Rapid Prototyping of Domain-Specific Machine Translation Systems p. 95 Time-Constrained Machine Translation p. 103 An Evaluation of the Multi-Engine MT Architecture p. 113 An Ontology-Based Approach to Parsing Turkish Sentences p. 124 Monolingual Translator Workstation p. 136 Fast Document Translation for Cross-Language Information Retrieval p....
متن کاملSentence Alignment by Means of Cross-Language Information Retrieval
In this chapter, we focus on the specific problem of sentence alignment given two comparable corpora. This task is essential to some specific applications such as parallel corpora compilation Utiyama & Tanimura (2007) and cross-language plagiarism detection Potthast et al. (2009). We address this problem by means of a cross-language information retrieval (CLIR) system. CLIR deals with the probl...
متن کاملStatistical Alignment Models for Translational Equivalence
The ever-increasing amount of parallel data opens a rich resource to multilingual natural language processing, enabling models to work on various translational aspects like detailed human annotations, syntax and semantics. With efficient statistical models, many cross-language applications have seen significant progresses in recent years, such as statistical machine translation, speech-to-speec...
متن کامل